Newest journal articles by subject

Subjects > Computer Science

Accurate structure prediction of biomolecular interactions with AlphaFold 3

Josh Abramson;Jonas Adler;Jack Dunger;Richard Evans;Tim Green;A. Pritzel;Olaf Ronneberger;Lindsay Willmore;Andrew J Ballard;Joshua Bambrick;Sebastian Bodenstein;David A Evans;Chia-Chun Hung;Michael O’Neill;D. Reiman;Kathryn Tunyasuvunakool;Zachary Wu;Akvilė Žemgulytė;Eirini Arvaniti;Charles Beattie;Ottavia Bertolli;Alex Bridgland;Alexey Cherepanov;Miles Congreve;A. Cowen-Rivers;Andrew Cowie;Michael Figurnov;Fabian B Fuchs;Hannah Gladman;Rishub Jain;Yousuf A. Khan;Caroline M R Low;Kuba Perlin;Anna Potapenko;Pascal Savy;Sukhdeep Singh;A. Stecula;Ashok Thillaisundaram;Catherine Tong;Sergei Yakneen;Ellen D. Zhong;Michal Zielinski;Augustin Žídek;V. Bapst;Pushmeet Kohli;Max Jaderberg;D. Hassabis;J. Jumper;

Nature Published 2024/05/08

Summary:

The new AlphaFold model with a substantially updated diffusion-based architecture that is capable of predicting the joint structure of complexes including proteins, nucleic acids, small molecules, ions and modified residues is described, showing that high-accuracy modelling across biomolecular space is possible within a single unified deep-learning framework.

The Llama 3 Herd of Models

Abhimanyu Dubey;Abhinav Jauhri;Abhinav Pandey;Abhishek Kadian;Ahmad Al-Dahle;Aiesha Letman;Akhil Mathur;Alan Schelten;Amy Yang;Angela Fan;Anirudh Goyal;Anthony S. Hartshorn;Aobo Yang;Archi Mitra;Archie Sravankumar;Artem Korenev;Arthur Hinsvark;Arun Rao;Aston Zhang;Aur'elien Rodriguez;Austen Gregerson;Ava Spataru;Baptiste Rozière;Bethany Biron;Binh Tang;Bobbie Chern;C. Caucheteux;Chaya Nayak;Chloe Bi;Chris Marra;Chris McConnell;Christian Keller;Christophe Touret;Chunyang Wu;Corinne Wong;Cris-tian Cantón Ferrer;Cyrus Nikolaidis;Damien Allonsius;Daniel Song;Danielle Pintz;Danny Livshits;David Esiobu;Dhruv Choudhary;Dhruv Mahajan;Diego Garcia-Olano;Diego Perino;Dieuwke Hupkes;Egor Lakomkin;Ehab A. AlBadawy;Elina Lobanova;Emily Dinan;Eric Michael Smith;Filip Radenovic;Frank Zhang;Gabriele Synnaeve;Gabrielle Lee;Georgia Lewis Anderson;Graeme Nail;G. Mialon;Guanglong Pang;Guillem Cucurell;Hailey Nguyen;Hannah Korevaar;Hu Xu;Hugo Touvron;Iliyan Zarov;Imanol Arrieta Ibarra;Isabel M. Kloumann;Ishan Misra;Ivan Evtimov;Jade Copet;Jaewon Lee;Jan Geffert;Jana Vranes;Jason Park;Jay Mahadeokar;Jeet Shah;J. V. D. Linde;Jennifer Billock;Jenny Hong;Jenya Lee;Jeremy Fu;Jianfeng Chi;Jianyu Huang;Jiawen Liu;Jie Wang;Jiecao Yu;Joanna Bitton;Joe Spisak;Jongsoo Park;Joseph Rocca;J. Johnstun;Joshua Saxe;Ju-Qing Jia;Kalyan Vasuden Alwala;K. Upasani;Kate Plawiak;Keqian Li;K. Heafield;Kevin R. Stone;Khalid El-Arini;Krithika Iyer;Kshitiz Malik;Kuen-ley Chiu;Kunal Bhalla;Lauren Rantala-Yeary;L. Maaten;Lawrence Chen;Liang Tan;Liz Jenkins;Louis Martin;Lovish Madaan;Lubo Malo;Lukas Blecher;Lukas Landzaat;Luke de Oliveira;Madeline Muzzi;Mahesh Pasupuleti;Mannat Singh;Manohar Paluri;Marcin Kardas;Mathew Oldham;Mathieu Rita;Maya Pavlova;M. Kambadur;Mike Lewis;Min Si;Mitesh Kumar Singh;Mona Hassan;Naman Goyal;Narjes Torabi;Niko-lay Bashlykov;Nikolay Bogoychev;Niladri S. Chatterji;Olivier Duchenne;Onur cCelebi;Patrick Alrassy;Pengchuan Zhang;Pengwei Li;Petar Vasić;Peter Weng;Prajjwal Bhargava;P. Dubal;Praveen Krishnan;Punit Singh Koura;Puxin Xu;Qing He;Qingxiao Dong;Ragavan Srinivasan;Raj Ganapathy;Ramon Calderer;Ricardo Silveira Cabral;Robert Stojnic;Roberta Raileanu;Rohit Girdhar;Rohit Patel;Ro-main Sauvestre;Ron-nie Polidoro;Roshan Sumbaly;Ross Taylor;Ruan Silva;Rui Hou;Rui Wang;S. Hosseini;Sa-hana Chennabasappa;Sanjay Singh;Sean Bell;Seohyun Sonia Kim;Sergey Edunov;Shaoliang Nie;Sharan Narang;S. Raparthy;Sheng Shen;Shengye Wan;Shruti Bhosale;Shun Zhang;Simon Vandenhende;Soumya Batra;Spencer Whitman;Sten Sootla;Stephane Collot;Suchin Gururangan;S. Borodinsky;Tamar Herman;Tara Fowler;Tarek Sheasha;Thomas Georgiou;Thomas Scialom;Tobias Speckbacher;Todor Mihaylov;Tong Xiao;Ujjwal Karn;Vedanuj Goswami;Vibhor Gupta;Vignesh Ramanathan;Viktor Kerkez;Vincent Gonguet;Vir-ginie Do;Vish Vogeti;Vladan Petrovic;Weiwei Chu;Wenhan Xiong;Wenyin Fu;Whit-ney Meers;Xavier Martinet;Xiaodong Wang;Xiaoqing Ellen Tan;Xinfeng Xie;Xuchao Jia;Xuewei Wang;Yaelle Goldschlag;Yashesh Gaur;Yasmine Babaei;Yiqian Wen;Yiwen Song;Yuchen Zhang;Yue Li;Yuning Mao;Zacharie Delpierre Coudert;Zhengxu Yan;Zhengxing Chen;Zoe Papakipos;Aaditya K. Singh;Aaron Grattafiori;Abha Jain;Adam Kelsey;Adam Shajnfeld;Adi Gangidi;Adolfo Victoria;Ahuva Goldstand;A. Menon;Ajay Sharma;Alex Boesenberg;Alex Vaughan;Alexei Baevski;Allie Feinstein;Amanda Kallet;Amit Sangani;Anam Yunus;Andrei Lupu;Andres Alvarado;A. Caples;Andrew Gu;Andrew Ho;Andrew Poulton;Andrew Ryan;Ankit Ramchandani;Annie Franco;Aparajita Saraf;Arkabandhu Chowdhury;Ashley Gabriel;Ashwin Bharambe;Assaf Eisenman;Azadeh Yazdan;Beau James;Ben Maurer;B. Leonhardi;Po-Yao (Bernie) Huang;Beth Loyd;Beto de Paola;Bhargavi Paranjape;Bing Liu;Bo Wu;Boyu Ni;Braden Hancock;Bram Wasti;Brandon Spence;Brani Stojkovic;Brian Gamido;Britt Montalvo;Carl Parker;Carly Burton;Catalina Mejia;Changhan Wang;Changkyu Kim;Chao Zhou;Chester Hu;Ching-Hsiang Chu;Chris Cai;Chris Tindal;Christoph Feichtenhofer;Damon Civin;Dana Beaty;Daniel Kreymer;Shang-Wen Li;Danny Wyatt;David Adkins;David Xu;Davide Testuggine;Delia David;Devi Parikh;Diana Liskovich;Didem Foss;Dingkang Wang;Duc Le;Dustin Holland;Edward Dowling;Eissa Jamil;Elaine Montgomery;Eleonora Presani;Emily Hahn;Emily Wood;Erik Brinkman;Esteban Arcaute;Evan Dunbar;Evan Smothers;Fei Sun;Felix Kreuk;Feng Tian;Firat Ozgenel;Francesco Caggioni;F. Guzm’an;Frank J. Kanayet;Frank Seide;Gabriela Medina Florez;Gabriella Schwarz;Gada Badeer;Georgia Swee;Gil Halpern;G. Thattai;Grant Herman;G. Sizov;Guangyi Zhang;Guna Lakshminarayanan;Hamid Shojanazeri;Han Zou;Hannah Wang;Han Zha;Haroun Habeeb;Harrison Rudolph;Helen Suk;Henry Aspegren;Hunter Goldman;Igor Molybog;Igor Tufanov;Irina-Elena Veliche;Itai Gat;Jake Weissman;James Geboski;James Kohli;Japhet Asher;Jean-Baptiste Gaya;Jeff Marcus;Jeff Tang;Jennifer Chan;Jenny Zhen;Jeremy Reizenstein;J. Teboul;Jessica Zhong;Jian Jin;Jingyi Yang;Joe Cummings;Jon Carvill;Jon Shepard;J. McPhie;Jonathan Torres;Josh Ginsburg;Junjie Wang;Kaixing(Kai) Wu;U. KamHou;Karan Saxena;Karthik Prasad;Kartikay Khandelwal;Katayoun Zand;Kathy Matosich;K. Veeraraghavan;Kelly Michelena;Keqian Li;Kun Huang;Kunal Chawla;Kushal Lakhotia;Kyle Huang;Lailin Chen;Lakshya Garg;A. Lavender;Leandro Silva;Lee Bell;Lei Zhang;Liangpeng Guo;Licheng Yu;Liron Moshkovich;Luca Wehrstedt;Madian Khabsa;Manav Avalani;Manish Bhatt;M. Tsimpoukelli;Martynas Mankus;Matan Hasson;M. Lennie;Matthias Reso;Maxim Groshev;Maxim Naumov;Maya Lathi;Meghan Keneally;M. Seltzer;Michal Valko;Michelle Restrepo;Mihir Patel;Mik Vyatskov;Mikayel Samvelyan;Mike Clark;Mike Macey;Mike Wang;Miquel Jubert Hermoso;Mo Metanat;Mohammad Rastegari;Munish Bansal;Nandhini Santhanam;Natascha Parks;Natasha White;Navy-ata Bawa;Nayan Singhal;Nick Egebo;Nicolas Usunier;Nikolay Pavlovich Laptev;Ning Dong;Ning Zhang;Norman Cheng;Oleg Chernoguz;Olivia Hart;Omkar Salpekar;Ozlem Kalinli;Parkin Kent;Parth Parekh;Paul Saab;Pavan Balaji;Pe-dro Rittner;Philip Bontrager;Pierre Roux;Piotr Dollár;Polina Zvyagina;Prashant Ratanchandani;P. Yuvraj;Qian Liang;Rachad Alao;Rachel Rodriguez;Rafi Ayub;Raghotham Murthy;Raghu Nayani;Rahul Mitra;Raymond Li;Rebekkah Hogan;Robin Battey;Rocky Wang;Rohan Maheswari;Russ Howes;Ruty Rinott;Sai Jayesh Bondu;Samyak Datta;Sara Chugh;Sara Hunt;Sargun Dhillon;Sasha Sidorov;Satadru Pan;Saurabh Verma;Seiji Yamamoto;Sharadh Ramaswamy;Shaun Lindsay;Sheng Feng;Shenghao Lin;S. Zha;Shiva Shankar;Shuqiang Zhang;Sinong Wang;Sneha Agarwal;S. Sajuyigbe;Soumith Chintala;Stephanie Max;Stephen Chen;Steve Kehoe;Steve Satterfield;Sudarshan Govindaprasad;Sumit Gupta;Sung-Bae Cho;Sunny Virk;Suraj Subramanian;Sy Choudhury;Sydney Goldman;Tal Remez;Tamar Glaser;Tamara Best;Thilo Kohler;Thomas Robinson;Tianhe Li;Tianjun Zhang;Tim Matthews;Timothy Chou;Tzook Shaked;Varun Vontimitta;Victoria Ajayi;Victoria Montanez;Vijai Mohan;Vinay Satish Kumar;Vishal Mangla;Vlad Ionescu;V. Poenaru;Vlad T. Mihailescu;Vladimir Ivanov;Wei Li;Wenchen Wang;

ArXiv Published 2024/07/31

Summary:

It is found that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks, and performs competitively with the state-of-the-art on image, video, and speech recognition tasks.

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Patrick Esser;Sumith Kulal;A. Blattmann;Rahim Entezari;Jonas Muller;Harry Saini;Yam Levi;Dominik Lorenz;Axel Sauer;Frederic Boesel;Dustin Podell;Tim Dockhorn;Zion English;Kyle Lacey;Alex Goodwin;Yannik Marek;Robin Rombach;

ArXiv Published 2024/03/05

Summary:

This work improves existing noise sampling techniques for training rectified flow models by biasing them towards perceptually relevant scales and presents a novel transformer-based architecture for text-to-image generation that uses separate weights for the two modalities and enables a bidirectional flow of information between image and text tokens.

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Marah Abdin;Sam Ade Jacobs;A. A. Awan;J. Aneja;Ahmed Awadallah;H. Awadalla;Nguyen Bach;Amit Bahree;Arash Bakhtiari;Harkirat Singh Behl;A. Benhaim;Misha Bilenko;Johan Bjorck;Sébastien Bubeck;Martin Cai;C. C. T. Mendes;Weizhu Chen;Vishrav Chaudhary;Parul Chopra;Allison Del Giorno;Gustavo de Rosa;Matthew Dixon;Ronen Eldan;Dan Iter;Abhishek Goswami;S. Gunasekar;Emman Haider;Junheng Hao;Russell J. Hewett;Jamie Huynh;Mojan Javaheripi;Xin Jin;Piero Kauffmann;Nikos Karampatziakis;Dongwoo Kim;Young Jin Kim;Mahoud Khademi;Lev Kurilenko;James R. Lee;Yin Tat Lee;Yuanzhi Li;Chen Liang;Weishung Liu;Eric Lin;Zeqi Lin;Piyush Madan;Arindam Mitra;Hardik Modi;Anh Nguyen;Brandon Norick;Barun Patra;D. Perez-Becker;Thomas Portet;Reid Pryzant;Heyang Qin;Marko Radmilac;Liliang Ren;Corby Rosset;Sambudha Roy;Olli Saarikivi;Amin Saied;Adil Salim;Michael Santacroce;Shital Shah;Ning Shang;Hiteshi Sharma;Xianmin Song;Olatunji Ruwase;Praneetha Vaddamanu;Xin Wang;Rachel Ward;Guanhua Wang;P. Witte;Michael Wyatt;Can Xu;Jiahang Xu;Sonali Yadav;Fan Yang;Ziyi Yang;Donghan Yu;Cheng-Yuan Zhang;Cyril Zhang;Jianwen Zhang;L. Zhang;Yi Zhang;Yunan Zhang;Xiren Zhou;Yifan Yang;

ArXiv Published 2024/04/22

Mixtral of Experts

Albert Q. Jiang;Alexandre Sablayrolles;Antoine Roux;A. Mensch;Blanche Savary;Chris Bamford;Devendra Singh Chaplot;Diego de Las Casas;Emma Bou Hanna;Florian Bressand;Gianna Lengyel;Guillaume Bour;Guillaume Lample;L'elio Renard Lavaud;Lucile Saulnier;M. Lachaux;Pierre Stock;Sandeep Subramanian;Sophia Yang;Szymon Antoniak;Teven Le Scao;Théophile Gervet;Thibaut Lavril;Thomas Wang;Timothée Lacroix;William El Sayed;

ArXiv Published 2024/01/08

Summary:

This work introduces Mixtral 8x7B, a Sparse Mixture of Experts (SMoE) language model that vastly outperforms Llama 2 70B on mathematics, code generation, and multilingual benchmarks and provides a model fine-tuned to follow instructions, Mixtral 8x7B - Instruct, that surpasses GPT-3.5 Turbo, Claude-2.1, Gemini Pro, and Llama 2 70B - chat model on human benchmarks.

Gemma 2: Improving Open Language Models at a Practical Size

Gemma Team Morgane Riviere;Shreya Pathak;Pier Giuseppe Sessa;Cassidy Hardin;Surya Bhupatiraju;L'eonard Hussenot;Thomas Mesnard;Bobak Shahriari;Alexandre Ram'e;Johan Ferret;Peter Liu;P. Tafti;Abe Friesen;Michelle Casbon;Sabela Ramos;Ravin Kumar;Charline Le Lan;Sammy Jerome;Anton Tsitsulin;Nino Vieillard;P. Stańczyk;Sertan Girgin;Nikola Momchev;Matt Hoffman;S. Thakoor;Jean-Bastien Grill;Behnam Neyshabur;Alanna Walton;A. Severyn;Alicia Parrish;Aliya Ahmad;Allen Hutchison;Alvin Abdagic;Amanda Carl;Amy Shen;Andy Brock;Andy Coenen;Anthony Laforge;Antonia Paterson;Ben Bastian;Bilal Piot;Boxi Wu;Brandon Royal;Charlie Chen;Chintu Kumar;Chris Perry;Christoper A. Welty;Christopher A. Choquette-Choo;Danila Sinopalnikov;David Weinberger;Dimple Vijaykumar;Dominika Rogozi'nska;D. Herbison;Elisa Bandy;Emma Wang;Eric Noland;Erica Moreira;Evan Senter;Evgenii Eltyshev;Francesco Visin;Gabriel Rasskin;Gary Wei;Glenn Cameron;Gus Martins;Hadi Hashemi;Hanna Klimczak-Pluci'nska;Harleen Batra;H. Dhand;Ivan Nardini;Jacinda Mein;Jack Zhou;James Svensson;J. Stanway;Jetha Chan;Jin Zhou;Joana Carrasqueira;Joana Iljazi;Jocelyn Becker;Joe Fernandez;Joost R. van Amersfoort;Josh Gordon;Josh Lipschultz;Joshua Newlan;Junsong Ji;Kareem Mohamed;Kartikeya Badola;Kat Black;Katie Millican;Keelin McDonell;Kelvin Nguyen;Kiranbir Sodhia;Kish Greene;Lars Lowe Sjoesund;Lauren Usui;L. Sifre;L. Heuermann;Leti-cia Lago;Lilly McNealus;Livio Baldini Soares;Logan Kilpatrick;Lucas Dixon;Luciano Martins;Machel Reid;Manvinder Singh;Mark Iverson;Martin Gorner;Mat Velloso;Mateo Wirth;Matt Davidow;Matt Miller;Matthew Rahtz;Matthew Watson;Meg Risdal;Mehran Kazemi;Michael Moynihan;Ming Zhang;Minsuk Kahng;Minwoo Park;Mofi Rahman;Mohit Khatwani;Natalie Dao;Nen-shad Bardoliwalla;N. Devanathan;Neta Dumai;Nilay Chauhan;O. Wahltinez;Pankil Botarda;Parker Barnes;P. Barham;Paul Michel;Peng-chong Jin;Petko Georgiev;Phil Culliton;Pradeep Kuppala;R. Comanescu;Ramona Merhej;Reena Jana;R. Rokni;Rishabh Agarwal;Ryan Mullins;Samaneh Saadat;S. M. Carthy;Sarah Perrin;Sébastien M. R. Arnold;Se-bastian Krause;Shengyang Dai;S. Garg;Shruti Sheth;Sue Ronstrom;Susan Chan;Timothy Jordan;Ting Yu;Tom Eccles;Tom Hennigan;Tomás Kociský;Tulsee Doshi;Vihan Jain;Vikas Yadav;Vilobh Meshram;Vishal Dharmadhikari;Warren Barkley;Wei Wei;Wenming Ye;Woohyun Han;Woosuk Kwon;Xiang Xu;Zhe Shen;Zhitao Gong;Zichuan Wei;Victor Cotruta;Phoebe Kirk;Anand Rao;Minh Giang;Ludovic Peran;Tris Warkentin;Eli Collins;Joelle Barral;Z. Ghahramani;R. Hadsell;D. Sculley;Jeanine Banks;Anca Dragan;Slav Petrov;O. Vinyals;Jeffrey Dean;D. Hassabis;K. Kavukcuoglu;Clément Farabet;Elena Buchatskaya;Sebastian Borgeaud;Noah Fiedel;Armand Joulin;Kathleen Kenealy;Robert Dadashi;Alek Andreev;

ArXiv Published 2024/07/31

Summary:

Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters, delivers the best performance for their size, and even offers competitive alternatives to models that are 2-3 times bigger.

GPT-4o System Card

OpenAI Aaron Hurst;Adam Lerer;Adam P. Goucher;Adam Perelman;Aditya Ramesh;Aidan Clark;AJ Ostrow;Akila Welihinda;Alan Hayes;Alec Radford;Aleksander Mkadry;Alex Baker-Whitcomb;Alex Beutel;A. Borzunov;Alex Carney;Alex Chow;Alexander Kirillov;Alex Nichol;Alex Paino;Alex Renzin;Alexandre Passos;Alexander Kirillov;Alexi Christakis;Alexis Conneau;Ali Kamali;A. Jabri;Allison Moyer;Allison Tam;Amadou Crookes;Amin Tootoochian;Amin Tootoonchian;Ananya Kumar;Andrea Vallone;A. Karpathy;Andrew Braunstein;Andrew Cann;Andrew Codispoti;Andrew Galu;Andrew Kondrich;Andrew Tulloch;An-drey Mishchenko;Angela Baek;Angela Jiang;An-toine Pelisse;Antonia Woodford;Anuj Gosalia;Arka Dhar;Ashley Pantuliano;Avi Nayak;Avital Oliver;Barret Zoph;B. Ghorbani;Ben Leimberger;Ben Rossen;Benjamin Sokolowsky;Ben Wang;Benjamin Zweig;Beth Hoover;B. Samic;Bob McGrew;Bobby Spero;Bogo Giertler;Bowen Cheng;Brad Lightcap;Brandon Walkin;Brendan Quinn;Brian Guarraci;Brian Hsu;Bright Kellogg;Brydon Eastman;Camillo Lugaresi;Carroll L. Wainwright;Cary Bassin;Cary Hudson;Casey Chu;Chad Nelson;Chak Li;C. Shern;Channing Conger;Charlotte Barette;Chelsea Voss;Chen Ding;Cheng Lu;Chong Zhang;Chris Beaumont;Chris Hallacy;Chris Koch;C. Gibson;Christina Kim;Christine Choi;Christine McLeavey;Chris Hesse;Claudia Fischer;Clemens Winter;Coley Czarnecki;Colin Jarvis;Colin Wei;Constantin Koumouzelis;Dane Sherburn;Daniel Kappler;Daniel Levin;Daniel Levy;David Carr;David Farhi;David Mély;David Robinson;David Sasaki;Denny Jin;Dev Valladares;Dimitris Tsipras;Doug Li;Phong Duc Nguyen;Duncan Findlay;Edede Oiwoh;Edmund Wong;Ehsan Asdar;Elizabeth Proehl;Elizabeth Yang;Eric Antonow;Eric Kramer;Eric Peterson;Eric Sigler;Eric Wallace;E. Brevdo;Evan Mays;Farzad Khorasani;F. Such;Filippo Raso;Francis Zhang;Fred von Lohmann;Freddie Sulit;Gabriel Goh;Gene Oden;Geoff Salmon;Giulio Starace;Greg Brockman;Hadi Salman;Hai-Biao Bao;Haitang Hu;Hannah Wong;Haoyu Wang;Heather Schmidt;Heather Whitney;Hee-woo Jun;Hendrik Kirchner;Henrique Pondé de Oliveira Pinto;Hongyu Ren;Huiwen Chang;Hyung Won Chung;I. Kivlichan;Ian O’Connell;Ian Osband;Ian Silber;Ian Sohl;İ. Okuyucu;Ikai Lan;Ilya Kostrikov;I. Sutskever;I. Kanitscheider;Ishaan Gulrajani;Jacob Coxon;Jacob Menick;J. Pachocki;James Aung;James Betker;James Crooks;James Lennon;J. Kiros;Jan Leike;Jane Park;Jason Kwon;Jason Phang;Jason Teplitz;Jason Wei;Jason Wolfe;Jay Chen;Jeff Harris;Jenia Varavva;Jessica Gan Lee;Jessica Shieh;Ji Lin;Jiahui Yu;Jiayi Weng;Jie Tang;Jieqi Yu;Joanne Jang;J. Q. Candela;Joe Beutler;Joe Landers;Joel Parish;Johannes Heidecke;John Schulman;Jonathan Lachman;Jonathan McKay;Jonathan Uesato;Jonathan Ward;Jong Wook Kim;Joost Huizinga;Jordan Sitkin;Jos Kraaijeveld;Joshua Gross;Josh Kaplan;Josh Snyder;Josh Achiam;Joy Jiao;Joyce Lee;Juntang Zhuang;Justyn Harriman;Kai Fricke;Kai Hayashi;Karan Singhal;Katy Shi;Kavin Karthik;Kayla Wood;Kendra Rimbach;Kenny Hsu;Kenny Nguyen;Keren Gu-Lemberg;Kevin Button;Kevin Liu;Kiel Howe;K. Muthukumar;Kyle Luther;Lama Ahmad;Larry Kai;Lauren Itow;Lauren Workman;Leher Pathak;Leo Chen;Li Jing;Lia Guy;L. Fedus;Liang Zhou;Lien Mamitsuka;Lilian Weng;Lindsay McCallum;Lindsey Held;Ouyang Long;Louis Feuvrier;Lu Zhang;Lukasz Kondraciuk;Lukasz Kaiser;Luke Hewitt;Luke Metz;Lyric Doshi;Mada Aflak;Maddie Simens;Made-laine Boyd;Madeleine Thompson;Marat Dukhan;Mark Chen;Mark Gray;M. Hudnall;Marvin Zhang;Marwan Aljubeh;Ma-teusz Litwin;Matthew Zeng;Max Johnson;Maya Shetty;Mayank Gupta;Meghan Shah;M. Yatbaz;Mengxue Yang;Mengchao Zhong;Mia Glaese;Mianna Chen;Michael Janner;Michael Lampe;Michael Petrov;Michael Wu;Michele Wang;Michelle Fradin;Michelle Pokrass;Miguel Castro;Miguel Castro;Mikhail Pavlov;M. Brundage;Miles Wang;Mina Khan;Mira Murati;Mo Bavarian;Molly Lin;Murat Yesildal;Nacho Soto;N. Gimelshein;Na-talie Cone;Natalie Staudacher;Natalie Summers;Natan LaFontaine;Neil Chowdhury;Nick Ryder;Nick Stathas;Nick Turley;N. Tezak;Niko Felix;Nithanth Kudige;N. Keskar;Noah Deutsch;Noel Bundick;Nora Puckett;Ofir Nachum;Ola Okelola;Oleg Boiko;O. Murk;Oliver Jaffe;Olivia Watkins;Olivier Godement;Owen Campbell-Moore;Patrick Chao;Paul McMillan;Pavel Belov;Peng Su;Peter Bak;Peter Bakkum;Peter Deng;Peter Dolan;Peter Hoeschele;P. Welinder;Phil Tillet;Philip Pronin;Phil Tillet;Prafulla Dhariwal;Qim-ing Yuan;Rachel Dias;Rachel Lim;Rahul Arora;Rajan Troll;Randall Lin;Raphael Gontijo Lopes;Raul Puri;Reah Miyara;R. Leike;Renaud Gaubert;Reza Zamani;Ricky Wang;Rob Donnelly;Rob Honsby;Rocky Smith;Rohan Sahai;Rohit Ramchandani;Romain Huet;Rory Carmichael;Rowan Zellers;Roy Chen;Ruby Chen;R. Nigmatullin;Ryan Cheu;Saachi Jain;Sam Altman;Sam Schoenholz;Sam Toizer;Samuel Miserendino;Sandhini Agarwal;Sara Culver;Scott Ethersmith;Scott Gray;Sean Grove;Sean Metzger;Shamez Hermani;Shantanu Jain;Shengjia Zhao;Sherwin Wu;Shino Jomoto;Shirong Wu;Shuaiqi Xia;Sonia Phene;Spencer Papay;Srinivas Narayanan;Steve Coffey;Steve Lee;Stewart Hall;S. Balaji;Tal Broda;Tal Stramer;Tao Xu;Tarun Gogineni;Taya Christianson;Ted Sanders;Tejal Patwardhan;Thomas Cunninghman;Thomas Degry;Thomas Dimson;Thomas Raoux;Thomas Shadwell;Tianhao Zheng;Todd Underwood;Todor Markov;Toki Sherbakov;Tom Rubin;Tom Stasi;Tomer Kaftan;Tristan Heywood;Troy Peterson;Tyce Walters;Tyna Eloundou;Valerie Qi;Veit Moeller;Vinnie Monaco;Vishal Kuo;Vlad Fomenko;Wayne Chang;Weiyi Zheng;Wenda Zhou;Wesam Manassra;Will Sheu;Wojciech Zaremba;Yash Patil;Yilei Qian;Yongjik Kim;Youlong Cheng;Yu Zhang;Yuchen He;Yuchen Zhang;Yujia Jin;Yunxing Dai;Yury Malkov;

ArXiv Published 2024/10/25

Summary:

This System Card provides a detailed look at GPT-4o's capabilities, limitations, and safety evaluations across multiple categories, focusing on speech-to-speech while also evaluating text and image capabilities, and measures the authors've implemented to ensure the model is safe and aligned.

Multicriteria Optimization and Decision Making: Principles, Algorithms and Case Studies

Michael T. M. Emmerich;A. Deutz;

ArXiv Published 2024/06/29

Summary:

The introduction is organized in a unique didactic manner developed by the authors, starting from more simple concepts such as linear programming and single-point methods, and advancing from these to more difficult concepts such as optimality conditions for nonlinear optimization and set-oriented solution algorithms.

The T2K experiment

T. Abe;N. Abgrall;H. Aihara;Y. Ajima;J. Albert;D. Allan;P. Amaudruz;C. Andreopoulos;B. Andrieu;M. Anerella;C. Angelsen;S. Aoki;O. Araoka;J. Argyriades;A. Ariga;T. Ariga;S. Assylbekov;J. Andr'e;D. Autiero;A. Badertscher;O. Ballester;M. Barbi;G. Barker;P. Baron;G. Barr;L. Bartoszek;M. Batkiewicz;F. Bay;S. Bentham;V. Berardi;B. Berger;H. Berns;I. Bertram;M. Besnier;J. Beucher;D. Beznosko;S. Bhadra;P. Birney;D. Bishop;E. Blackmore;F. Blaszczyk;J. Błocki;A. Blondel;A. Bodek;C. Bojechko;J. Bouchez;T. Boussuge;S. Boyd;M. Boyer;N. Braam;R. Bradford;A. Bravar;K. Briggs;J. Brinson;C. Bronner;D. Brook-Roberge;M. Bryant;N. Buchanan;H. Budd;M. Cadabeschi;R. Calland;D. Calvet;J. Rodr'iguez;J. Carroll;S. Cartwright;A. Carver;R. Castillo;M. Catanesi;C. Cavata;A. Cazes;A. Cervera;J. Charrier;C. Chávez;S. Choi;S. Chollet;G. Christodoulou;P. Colas;J. Coleman;W. Coleman;G. Collazuol;K. Connolly;P. Cooke;A. Curioni;A. Dabrowska;I. Dankó;R. Das;G. Davies;S. Davis;M. Day;X. Broise;P. Perio;G. Rosa;T. Dealtry;A. Debraine;E. Delagnes;A. Delbart;C. Densham;F. Lodovico;S. Luise;P. Tran;J. Dobson;J. Doornbos;U. Dore;O. Drapier;F. Druillole;F. Dufour;J. Dumarchez;T. Durkin;S. Dytman;M. Dziewiecki;M. Dziomba;B. Ellison;S. Emery;A. Ereditato;J. Escallier;L. Escudero;L. Esposito;W. Faszer;M. Fechner;A. Ferrero;A. Finch;C. Fisher;M. Fitton;R. Flight;D. Forbush;E. Frank;K. Fransham;Y. Fujii;Y. Fukuda;M. Gallop;V. Galymov;G. Ganetis;F. Gannaway;A. Gaudín;J. Gaweda;A. Gendotti;M. George;S. Giffin;C. Giganti;K. Gilje;I. Giomataris;J. Giraud;A. Ghosh;T. Golan;M. Goldhaber;J. Gómez-Cadenas;S. Gomi;M. Gonin;M. Goyette;A. Grant;N. Grant;F. Grañena;S. Greenwood;P. Gumplinger;P. Guzowski;M. Haigh;K. Hamano;C. Hansen;T. Hara;P. Harrison;B. Hartfiel;M. Hartz;T. Haruyama;R. Hasanen;T. Hasegawa;N. Hastings;S. Hastings;A. Hatzikoutelis;K. Hayashi;Y. Hayato;T. Haycock;C. Hearty;R. Helmer;R. Henderson;S. Herlant;N. Higashi;J. Hignight;K. Hiraide;E. Hirose;J. Holeczek;N. Honkanen;S. Horikawa;A. Hyndman;A. Ichikawa;K. Ieki;M. Ieva;M. Iida;M. Ikeda;J. Ilic;J. Imber;T. Ishida;C. Ishihara;T. Ishii;S. Ives;M. Iwasaki;K. Iyogi;A. Izmaylov;B. Jamieson;R. Johnson;K. Joo;G. Jover-Manas;C. Jung;H. Kaji;T. Kajita;H. Kakuno;J. Kameda;K. Kaneyuki;D. Karlen;K. Kasami;V. Kasey;I. Kato;H. Kawamuko;E. Kearns;L. Kellet;M. Khabibullin;M. Khaleeq;N. Khan;A. Khotjantsev;D. Kiełczewska;T. Kikawa;J. Kim;S. Kim;N. Kimura;B. Kirby;J. Kisiel;P. Kitching;T. Kobayashi;G. Kogan;S. Koike;T. Komorowski;A. Konaka;L. Kormos;A. Korzenev;K. Koseki;Y. Koshio;Y. Kouzuma;K. Kowalik;V. Kravtsov;I. Kreslo;W. Kropp;H. Kubo;J. Kubota;Y. Kudenko;N. Kulkarni;L. Kurchaninov;Y. Kurimoto;R. Kurjata;Y. Kurosawa;T. Kutter;J. Lagoda;K. Laihem;R. Langstaff;M. Laveder;T. Lawson;P. T. Le;A. Coguie;M. L. Ross;K. P. Lee;M. Lenckowski;C. Licciardi;I. Lim;T. Lindner;R. P. Litchfield;A. Longhin;G. López;P. Lu;L. Ludovici;T. Lux;M. Macaire;L. Magaletti;K. Mahn;Y. Makida;C. J. Malafis;M. Małek;S. Manly;A. Marchionni;C. Mark;A. Marino;A. Marone;J. Marteau;J. Martin;T. Maruyama;T. Maryon;J. Marzec;P. Masliah;E. Mathie;C. Matsumura;K. Matsuoka;V. Matveev;K. Mavrokoridis;E. Mazzucato;N. McCauley;K. McFarland;C. Mcgrew;T. McLachlan;I. Mercer;M. Messina;W. Metcalf;C. Metelko;M. Mezzetto;P. Mijakowski;C. A. Miller;A. Minamino;O. Mineev;S. Mine;R. Minvielle;G. Mituka;M. Miura;K. Mizouchi;J. Mols;L. Monfregola;E. Monmarthe;F. Moreau;B. Morgan;S. Moriyama;D. Morris;A. Muir;A. Murakami;J. Muratore;M. Murdoch;S. Murphy;J. Myslik;G. Nagashima;T. Nakadaira;M. Nakahata;T. Nakamoto;K. Nakamura;S. Nakayama;T. Nakaya;D. Naples;B. Nelson;T. Nicholls;K. Nishikawa;H. Nishino;K. Nitta;F. Nizery;J. Nowak;M. Noy;Y. Obayashi;T. Ogitsu;H. Ohhata;T. Okamura;K. Okumura;T. Okusawa;C. Ohlmann;K. Olchanski;R. Openshaw;S. Oser;M. Otani;R. Owen;Y. Oyama;T. Ozaki;M. Pac;V. Palladino;V. Paolone;P. Paul;D. Payne;G. Pearce;C. Pearson;J. Perkin;M. Pfleger;F. Pierre;D. Pierrepont;P. Plonski;P. Poffenberger;E. Popławska;B. Popov;M. Posiadała;J. Poutissou;R. Poutissou;R. Preece;P. Przewłocki;W. Qian;J. Raaf;E. Radicioni;K. Ramos;P. Ratoff;T. Raufer;M. Ravonel;M. Raymond;F. Retière;D. Richards;J. Ritou;A. Robert;P. Rodrigues;E. Rondio;M. Roney;M. Rooney;D. Ross;B. Rossi;S. Roth;A. Rubbia;D. Ruterbories;R. Sacco;S. Sadler;K. Sakashita;F. Sánchez;A. Sarrat;K. Sasaki;P. Schaack;J. Schmidt;K. Scholberg;J. Schwehr;M. Scott;D. Scully;Y. Seiya;T. Sekiguchi;H. Sekiya;G. Sheffer;M. Shibata;Y. Shimizu;M. Shiozawa;S. Short;M. Siyad;D. Smith;R. Smith;M. Smy;J. Sobczyk;H. Sobel;S. Sooriyakumaran;M. Sorel;J. Spitz;A. Stahl;P. Stamoulis;O. Star;J. Statter;L. Stawnyczy;J. Steinmann;J. Steffens;B. Still;M. Stodulski;J. Stone;C. Strabel;T. Strauss;R. Sulej;P. Sutcliffe;A. Suzuki;K. Suzuki;S. Suzuki;S. Suzuki;Y. Suzuki;J. Swierblewski;T. Szegłowski;M. Szeptycka;R. Tacik;M. Tada;A. Tadepalli;M. Taguchi;S. Takahashi;A. Takeda;Y. Takenaga;Y. Takeuchi;H. Tanaka;K. Tanaka;M. Tanaka;M. Tanaka;N. Tanimoto;K. Tashiro;I. Taylor;A. Terashima;D. Terhorst;R. Terri;L. Thompson;A. Thorley;M. Thorpe;W. Toki;T. Tomaru;Y. Totsuka;C. Touramanis;T. Tsukamoto;V. Tvaskis;M. Tzanov;Y. Uchida;K. Ueno;M. Usseglio;A. Vacheret;M. Vagins;J. Schalkwyk;J. Vanel;G. Vasseur;O. Veledar;P. Vincent;T. Wachala;A. Waldron;C. Walter;P. Wanderer;M. Ward;G. Ward;D. Wark;D. Warner;M. Wascko;A. Weber;R. Wendell;J. Wendland;N. West;L. Whitehead;G. Wikstrom;R. Wilkes;M. Wilking;Z. Williamson;

Scholarpedia

Gemma: Open Models Based on Gemini Research and Technology

Gemma Team Thomas Mesnard;Cassidy Hardin;Robert Dadashi;Surya Bhupatiraju;Shreya Pathak;L. Sifre;Morgane Rivière;Mihir Kale;J Christopher Love;P. Tafti;L'eonard Hussenot;Aakanksha Chowdhery;Adam Roberts;Aditya Barua;Alex Botev;Alex Castro-Ros;Ambrose Slone;Am'elie H'eliou;Andrea Tacchetti;Anna Bulanova;Antonia Paterson;Beth Tsai;Bobak Shahriari;Charline Le Lan;Christopher A. Choquette-Choo;Clé-ment Crepy;Daniel Cer;Daphne Ippolito;David Reid;Elena Buchatskaya;Eric Ni;Eric Noland;Geng Yan;George Tucker;George-Christian Muraru;Grigory Rozhdestvenskiy;H. Michalewski;Ian Tenney;Ivan Grishchenko;Jacob Austin;James Keeling;Jane Labanowski;Jean-Baptiste Lespiau;J. Stanway;Jenny Brennan;Jeremy Chen;Johan Ferret;Justin Chiu;J. Mao-Jones;Kather-ine Lee;Kathy Yu;Katie Millican;Lars Lowe Sjoesund;Lisa Lee;Lucas Dixon;Machel Reid;Maciej Mikuła;Mateo Wirth;Michael Sharman;Nikolai Chinaev;Nithum Thain;Olivier Bachem;Os-car Chang;O. Wahltinez;Paige Bailey;Paul Michel;Petko Yotov;Pier Giuseppe Sessa;R. Chaabouni;R. Comanescu;Reena Jana;Rohan Anil;Ross McIlroy;Ruibo Liu;Ryan Mullins;Samuel L. Smith;Sebastian Borgeaud;Sertan Girgin;Sholto Douglas;Shree Pandya;Siamak Shakeri;Soham De;Ted Klimenko;Tom Hennigan;Vladimir Feinberg;Wojciech Stokowiec;Yu-hui Chen;Zafarali Ahmed;Zhitao Gong;Tris Warkentin;Ludovic Peran;Minh Giang;Clément Farabet;O. Vinyals;Jeffrey Dean;K. Kavukcuoglu;D. Hassabis;Z. Ghahramani;Douglas Eck;Joelle Barral;Fernando Pereira;Eli Collins;Armand Joulin;Noah Fiedel;Evan Senter;Alek Andreev;Kathleen Kenealy;

ArXiv Published 2024/03/13

Summary:

This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models, and presents comprehensive evaluations of safety and responsibility aspects of the models, alongside a detailed description of model development.

OpenVLA: An Open-Source Vision-Language-Action Model

Moo Jin Kim;Karl Pertsch;Siddharth Karamcheti;Ted Xiao;A. Balakrishna;Suraj Nair;Rafael Rafailov;Ethan Foster;Grace Lam;Pannag R. Sanketi;Quan Vuong;Thomas Kollar;Benjamin Burchfiel;Russ Tedrake;Dorsa Sadigh;Sergey Levine;Percy Liang;Chelsea Finn;

ArXiv Published 2024/06/13

Summary:

OpenVLA, a 7B-parameter open-source VLA trained on a diverse collection of 970k real-world robot demonstrations, is introduced and it is shown that it can effectively fine-tune OpenVLA for new settings, with especially strong generalization results in multi-task environments involving multiple objects and strong language grounding abilities.

StarCoder 2 and The Stack v2: The Next Generation

Anton Lozhkov;Raymond Li;Loubna Ben Allal;Federico Cassano;J. Lamy-Poirier;Nouamane Tazi;Ao Tang;Dmytro Pykhtar;Jiawei Liu;Yuxiang Wei;Tianyang Liu;Max Tian;Denis Kocetkov;Arthur Zucker;Younes Belkada;Zijian Wang;Qian Liu;Dmitry Abulkhanov;Indraneil Paul;Zhuang Li;Wen-Ding Li;Megan L. Risdal;Jia Li;Jian Zhu;Terry Yue Zhuo;Evgenii Zheltonozhskii;Nii Osae Osae Dade;W. Yu;Lucas Krauss;Naman Jain;Yixuan Su;Xuanli He;Manan Dey;Edoardo Abati;Yekun Chai;Niklas Muennighoff;Xiangru Tang;Muhtasham Oblokulov;Christopher Akiki;Marc Marone;Chenghao Mou;Mayank Mishra;A. Gu;Binyuan Hui;Tri Dao;A. Zebaze;Olivier Dehaene;N. Patry;Canwen Xu;Julian J. McAuley;Han Hu;Torsten Scholak;Sébastien Paquet;Jennifer Robinson;C. Anderson;Nicolas Chapados;M. Patwary;Nima Tajbakhsh;Yacine Jernite;Carlos Muñoz Ferrandis;Lingming Zhang;Sean Hughes;Thomas Wolf;Arjun Guha;L. V. Werra;H. D. Vries;

ArXiv Published 2024/02/29

Summary:

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2, a large model that significantly outperforms other models of comparable size and makes the model weights available under an OpenRAIL license.

OLMo: Accelerating the Science of Language Models

Dirk Groeneveld;Iz Beltagy;Pete Walsh;Akshita Bhagia;Rodney Kinney;Oyvind Tafjord;A. Jha;Hamish Ivison;Ian Magnusson;Yizhong Wang;Shane Arora;David Atkinson;Russell Authur;Khyathi Raghavi Chandu;Arman Cohan;Jennifer Dumas;Yanai Elazar;Yuling Gu;Jack Hessel;Tushar Khot;William Merrill;Jacob Daniel Morrison;Niklas Muennighoff;Aakanksha Naik;Crystal Nam;Matthew E. Peters;Valentina Pyatkin;Abhilasha Ravichander;Dustin Schwenk;Saurabh Shah;Will Smith;Emma Strubell;Nishant Subramani;Mitchell Wortsman;Pradeep Dasigi;Nathan Lambert;Kyle Richardson;Luke S. Zettlemoyer;Jesse Dodge;Kyle Lo;Luca Soldaini;Noah A. Smith;Hanna Hajishirzi;

Published 2024/02/01

Summary:

OLMo is built, a competitive, truly Open Language Model, to enable the scientific study of language models and it is hoped this release will empower the open research community and inspire a new wave of innovation.

Book review: Christoph Molnar. 2020. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable

R. K. Sinha;

Metamorphosis Published 2024/06/01

Heart Disease Prediction Using Machine Learning Algorithms

Dina Jrab;Derar Eleyan;A. Eleyan;Tarek Bejaoui;

2024 International Conference on Smart Applications, Communications and Networking (SmartNets) Published 2024/05/28

Summary:

The experimental results show that the ANDV A F -test feature selection algorithm along with the Support Vector Machine classifier, is a viable approach for developing an advanced intelligent system that can identify heart disease.

The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024

F. Madeira;Nandana Madhusoodanan;Joon Lee;Alberto Eusebi;Ania Niewielska;A. Tivey;Rodrigo Lopez;Sarah Butcher;

Nucleic Acids Research Published 2024/04/10

Summary:

Recent improvements to Job Dispatcher are overviews, including its brand new website and documentation, enhanced visualisations, improved job management, and a rising trend of user reliance on the service from low- and middle-income regions.

TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods

Gary S. Collins;K. Moons;P. Dhiman;R. Riley;A. L. Beam;Ben Van Calster;Marzyeh Ghassemi;Xiaoxuan Liu;Johannes B Reitsma;M. van Smeden;A. Boulesteix;J. Camaradou;L. Celi;S. Denaxas;A. Denniston;Ben Glocker;Robert M Golub;Hugh Harvey;G. Heinze;Michael M. Hoffman;A. Kengne;Emily Lam;Naomi Lee;Elizabeth W Loder;Lena Maier-Hein;B. Mateen;M. Mccradden;Lauren Oakden-Rayner;Johan Ordish;Richard Parnell;Sherri Rose;Karandeep Singh;L. Wynants;P. Logullo;

The BMJ Published 2024/04/16

Summary:

The development of TRIPOD+AI is described and the expanded 27 item checklist with more detailed explanation of each reporting recommendation is presented, and the TRIPOD+AI for Abstracts checklist is presented.

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Luca Soldaini;Rodney Kinney;Akshita Bhagia;Dustin Schwenk;David Atkinson;Russell Authur;Ben Bogin;Khyathi Raghavi Chandu;Jennifer Dumas;Yanai Elazar;Valentin Hofmann;A. Jha;Sachin Kumar;L. Lucy;Xinxi Lyu;Nathan Lambert;Ian Magnusson;Jacob Daniel Morrison;Niklas Muennighoff;Aakanksha Naik;Crystal Nam;Matthew E. Peters;Abhilasha Ravichander;Kyle Richardson;Zejiang Shen;Emma Strubell;Nishant Subramani;Oyvind Tafjord;Pete Walsh;Luke S. Zettlemoyer;Noah A. Smith;Hanna Hajishirzi;Iz Beltagy;Dirk Groeneveld;Jesse Dodge;Kyle Lo;

ArXiv Published 2024/01/31

Summary:

To facilitate scientific research on language model pretraining, Dolma is curate and released, a three-trillion-token English corpus built from a diverse mixture of web content, scientific papers, code, public-domain books, social media, and encyclopedic materials.

GPT-4 passes the bar exam

D. Katz;M. Bommarito;Shang Gao;Pablo Arredondo;

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences Published 2024/02/26

Summary:

GPT-4 significantly outperforms both human test-takers and prior models, demonstrating a 26% increase over ChatGPT and beating humans in five of seven subject areas, document not just the rapid and remarkable advance of large language model performance generally, but also the potential for such models to support the delivery of legal services in society.

RewardBench: Evaluating Reward Models for Language Modeling

Nathan Lambert;Valentina Pyatkin;Jacob Daniel Morrison;Lester James Validad Miranda;Bill Yuchen Lin;Khyathi Raghavi Chandu;Nouha Dziri;Sachin Kumar;Tom Zick;Yejin Choi;Noah A. Smith;Hanna Hajishirzi;

ArXiv Published 2024/03/20

Summary:

The RewardBench dataset is a collection of prompt-chosen-rejected trios spanning chat, reasoning, and safety, to benchmark how reward models perform on challenging, structured and out-of-distribution queries and presents many findings on propensity for refusals, reasoning limitations, and instruction following shortcomings of various reward models towards a better understanding of the RLHF process.

Iterative enhancement fusion-based cascaded model for detection and localization of multiple disease from CXR-Images

Satvik Vats;Vikrant Sharma;Karan Singh;Devesh Pratap Singh;Mohd Yazid Bajuri;David Taniar;Nisreen Innab;A. Mouldi;A. Ahmadian;

Expert Syst. Appl. Published 2024/06/01

Quantum error correction below the surface code threshold

R. Acharya;Laleh Aghababaie-Beni;I. Aleiner;Trond I. Andersen;M. Ansmann;F. Arute;K. Arya;A. Asfaw;N. Astrakhantsev;J. Atalaya;R. Babbush;Dave Bacon;B. Ballard;J. C. Bardin;J. Bausch;A. Bengtsson;A. Bilmes;S. Blackwell;S. Boixo;G. Bortoli;A. Bourassa;J. Bovaird;L. Brill;M. Broughton;D. A. Browne;B. Buchea;B. Buckley;D. Buell;T. Burger;B. Burkett;N. Bushnell;A. Cabrera;J. Campero;Hung-Shen Chang;Yu Chen;Zijun Chen;B. Chiaro;Desmond Chik;Charina Chou;J. Claes;A. Cleland;J. Cogan;R. Collins;P. Conner;W. Courtney;A. Crook;B. Curtin;Sayan Das;A. Davies;L. D. Lorenzo;D. Debroy;S. Demura;M. Devoret;A. D. Paolo;P. Donohoe;I. Drozdov;A. Dunsworth;C. Earle;T. Edlich;A. Eickbusch;A. M. Elbag;M. Elzouka;C. Erickson;L. Faoro;E. Farhi;V. S. Ferreira;L. F. Burgos;E. Forati;A. Fowler;B. Foxen;S. Ganjam;G. Garcia;R. Gasca;'Elie Genois;W. Giang;C. Gidney;D. Gilboa;R. Gosula;A. Dau;D. Graumann;A. Greene;J. Gross;S. Habegger;John Hall;Michael C. Hamilton;M. Hansen;M. Harrigan;S. D. Harrington;F. J. H. Heras;S. Heslin;P. Heu;O. Higgott;G. Hill;J. Hilton;George Holland;Sabrina Hong;Hsin-Yuan Huang;A. Huff;W. Huggins;L. Ioffe;S. Isakov;J. Iveland;E. Jeffrey;Zhang Jiang;Cody Jones;S. Jordan;C. Joshi;P. Juhás;D. Kafri;Hui Kang;A. Karamlou;K. Kechedzhi;J. Kelly;T. Khaire;T. Khattar;M. Khezri;Seon Kim;P. Klimov;A. Klots;B. Kobrin;Pushmeet Kohli;A. Korotkov;F. Kostritsa;Robin Kothari;Borislav M. Kozlovskii;J. Kreikebaum;V. D. Kurilovich;N. Lacroix;D. Landhuis;T. Lange-Dei;B. W. Langley;P. Laptev;K. Lau;L. L. Guevel;J. Ledford;Kenny Lee;Y. Lensky;Shannon Leon;B. Lester;Wing Yan Li;Yin Li;A. Lill;Wayne Liu;W. Livingston;A. Locharla;E. Lucero;D. Lundahl;A. Lunt;S. Madhuk;F. Malone;A. Maloney;Salvatore Mandr'a;L. S. Martin;Steven Martin;O. Martin;C. Maxfield;J. McClean;M. McEwen;S. Meeks;A. Megrant;X. Mi;K. Miao;A. Mieszala;R. Molavi;S. Molina;S. Montazeri;A. Morvan;R. Movassagh;W. Mruczkiewicz;O. Naaman;Matthew Neeley;C. Neill;A. Nersisyan;H. Neven;Michael Newman;J. Ng;A. Nguyen;M. Nguyen;Chia-Hung Ni;T. O’Brien;W. D. Oliver;A. Opremcak;K. Ottosson;A. Petukhov;A. Pizzuto;John Platt;R. Potter;O. Pritchard;L. Pryadko;C. Quintana;G. Ramachandran;M. Reagor;D. M. Rhodes;G. Roberts;Eliot Rosenberg;Emma L. Rosenfeld;P. Roushan;N. Rubin;N. Saei;D. Sank;K. Sankaragomathi;K. Satzinger;H. Schurkus;C. Schuster;A. W. Senior;M. Shearn;A. Shorter;N. Shutty;V. Shvarts;Shraddha Singh;V. Sivak;J. Skruzny;S. Small;V. Smelyanskiy;W. C. Smith;Rolando D Somma;S. Springer;G. Sterling;D. Strain;J. Suchard;A. Szasz;A. Sztein;D. Thor;A. Torres;M. M. Torunbalci;A. Vaishnav;J. Vargas;S. Vdovichev;G. Vidal;B. Villalonga;C. V. Heidweiller;S. Waltman;Shannon X. Wang;B. Ware;Kate Weber;T. White;Kristi Wong;B. Woo;C. Xing;Z. Yao;P. Yeh;B. Ying;Juhwan Yoo;N. Yosri;G. Young;Adam Zalcman;Yaxing Zhang;N. Zhu;N. Zobrist;

Nature Published 2024/08/24

Summary:

Two below-threshold surface code memories on Willow, a distance-7 code and a distance-5 code integrated with a real-time decoder, indicate device performance that, if scaled, could realize the operational requirements of large-scale fault-tolerant quantum algorithms.

Navigating the confluence of artificial intelligence and education for sustainable development in the era of industry 4.0: Challenges, opportunities, and ethical dimensions

A. Abulibdeh;Esmat Zaidan;R. Abulibdeh;

Journal of Cleaner Production Published 2024/01/01

Artificial intelligence, firm growth, and product innovation

T. Babina;A. Fedyk;A. He;James Hodson;

Journal of Financial Economics Published 2024/01/01

Summary:

A new measure of firm-level AI investments is proposed, using a unique combination of worker resume and job postings datasets, which reveals a stark increase in AI investments across sectors.

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Coleman Hooper;Sehoon Kim;Hiva Mohammadzadeh;Michael W. Mahoney;Y. Shao;Kurt Keutzer;A. Gholami;

ArXiv Published 2024/01/31

Summary:

This work facilitates low precision KV cache quantization by incorporating several novel methods, including per-Channel Key Quantization, and develops custom CUDA kernels for KVQuant, which enables serving LLaMA-7B with a context length of up to 1 million on a single A100-80GB GPU and up to 10 million on an 8-GPU system.

PaliGemma: A versatile 3B VLM for transfer

Lucas Beyer;A. Steiner;André Susano Pinto;Alexander Kolesnikov;Xiao Wang;Daniel M. Salz;Maxim Neumann;Ibrahim M. Alabdulmohsin;Michael Tschannen;Emanuele Bugliarello;Thomas Unterthiner;Daniel Keysers;Skanda Koppula;Fangyu Liu;Adam Grycner;A. Gritsenko;N. Houlsby;Manoj Kumar;Keran Rong;Julian Martin Eisenschlos;Rishabh Kabra;Matthias Bauer;Matko Bovsnjak;Xi Chen;Matthias Minderer;P. Voigtlaender;Ioana Bica;Ivana Balazevic;J. Puigcerver;Pinelopi Papalampidi;Olivier Henaff;Xi Xiong;Radu Soricut;Jeremiah Harmsen;Xiao-Qi Zhai;

ArXiv Published 2024/07/10

Summary:

PaliGemma is an open Vision-Language Model that is based on the SigLIP-So400m vision encoder and the Gemma-2B language model that achieves strong performance on a wide variety of open-world tasks.

jtools: Analysis and Presentation of Social Scientific Data

Jacob A. Long;

J. Open Source Softw. Published 2024/09/06

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Matt Deitke;Christopher Clark;Sangho Lee;Rohun Tripathi;Yue Yang;Jae Sung Park;Mohammadreza Salehi;Niklas Muennighoff;Kyle Lo;Luca Soldaini;Jiasen Lu;Taira Anderson;Erin Bransom;Kiana Ehsani;Huong Ngo;YenSung Chen;Ajay Patel;Mark Yatskar;Christopher Callison-Burch;Andrew Head;Rose Hendrix;F. Bastani;Eli VanderBilt;Nathan Lambert;Yvonne Chou;Arnavi Chheda;Jenna Sparks;Sam Skjonsberg;Michael Schmitz;Aaron Sarnat;Byron Bischoff;Pete Walsh;Christopher Newell;Piper Wolters;Tanmay Gupta;Kuo-Hao Zeng;Jon Borchardt;Dirk Groeneveld;Jennifer Dumas;Crystal Nam;Sophie Lebrecht;Caitlin Marie Wittlif;Carissa Schoenick;Oscar Michel;Ranjay Krishna;Luca Weihs;Noah A. Smith;Hanna Hajishirzi;Ross Girshick;Ali Farhadi;Aniruddha Kembhavi;

ArXiv

Summary:

Molmo is presented, a new family of VLMs that are state-of-the-art in their class of openness, with a novel, highly detailed image caption dataset collected entirely from human annotators using speech-based descriptions.

Cosmos World Foundation Model Platform for Physical AI

Nvidia Niket Agarwal;Arslan Ali;Maciej Bala;Yogesh Balaji;Erik Barker;Tiffany Cai;Prithvijit Chattopadhyay;Yongxin Chen;Yin Cui;Yifan Ding;Daniel Dworakowski;Jiaojiao Fan;Michele Fenzi;Francesco Ferroni;Sanja Fidler;Dieter Fox;Songwei Ge;Yunhao Ge;Jinwei Gu;Siddharth Gururani;Ethan He;Jiahui Huang;J. Huffman;Pooya Jannaty;Jingyi Jin;Seung Wook Kim;Gergely Kl'ar;Grace Lam;Shiyi Lan;L. Leal-Taixé;Anqi Li;Zhaoshuo Li;Chen-Hsuan Lin;Tsung-Yi Lin;Huan Ling;Mingkun Liu;Xian Liu;Alice Luo;Qianli Ma;Hanzi Mao;Kaichun Mo;A. Mousavian;Seungjun Nah;Sriharsha Niverty;David Page;Despoina Paschalidou;Zeeshan Patel;Lindsey Pavao;Morteza Ramezanali;F. Reda;Xiao-Shuai Ren;Vasanth Rao Naik Sabavat;Ed Schmerling;Stella Shi;Bartosz Stefaniak;Shitao Tang;Lyne P. Tchapmi;Przemek Tredak;Wei-Cheng Tseng;J. Varghese;Hao Wang;Haoxiang Wang;Hengyi Wang;Tingwei Wang;Fangyin Wei;Xinyue Wei;Jay Zhangjie Wu;Jiashu Xu;Wei Yang;Lin Yen-Chen;Xiaohui Zeng;Yuan Zeng;Jing Zhang;Qinsheng Zhang;Yuxuan Zhang;Qingqing Zhao;Artur Zolkowski;

ArXiv Published 2025/01/07

Summary:

The Cosmos World Foundation Model Platform is presented to help developers build customized world models for their Physical AI setups and position a world foundation model as a general-purpose world model that can be fine-tuned into customized world models for downstream applications.

Gemma 3 Technical Report

Gemma Team Aishwarya Kamath;Johan Ferret;Shreya Pathak;Nino Vieillard;Ramona Merhej;Sarah Perrin;Tatiana Matejovicova;Alexandre Ram'e;Morgane Rivière;Louis Rouillard;Thomas Mesnard;Geoffrey Cideron;Jean-Bastien Grill;Sabela Ramos;Edouard Yvinec;Michelle Casbon;Etienne Pot;Ivo Penchev;Gael Liu;Francesco Visin;Kathleen Kenealy;Lucas Beyer;Xiaohai Zhai;Anton Tsitsulin;R. Busa-Fekete;Alex Feng;Noveen Sachdeva;Benjamin Coleman;Yi Gao;Basil Mustafa;Iain Barr;Emilio Parisotto;David Tian;Matan Eyal;Colin Cherry;Jan-Thorsten Peter;Danila Sinopalnikov;Surya Bhupatiraju;Rishabh Agarwal;Mehran Kazemi;Dan Malkin;Ravin Kumar;David Vilar;Idan Brusilovsky;Jiaming Luo;A. Steiner;Abe Friesen;Abhanshu Sharma;Abheesht Sharma;Adi Mayrav Gilady;Adrian Goedeckemeyer;Alaa Saade;Alexander Kolesnikov;Alexei Bendebury;Alvin Abdagic;Amit Vadi;Andr'as Gyorgy;André Susano Pinto;Anil Das;Ankur Bapna;Antoine Miech;Antoine Yang;Antonia Paterson;Ashish Shenoy;Ayan Chakrabarti;Bilal Piot;Boxi Wu;Bobak Shahriari;Bryce Petrini;Charlie Chen;Charline Le Lan;Christopher A. Choquette-Choo;CJ Carey;C. Brick;Daniel Deutsch;Danielle Eisenbud;Dee Cattle;Derek Cheng;Dimitris Paparas;Divyashree Shivakumar Sreepathihalli;Doug Reid;Dustin Tran;Dustin Zelle;Eric Noland;Erwin Huizenga;E. Kharitonov;Frederick Liu;G. Amirkhanyan;Glenn Cameron;Hadi Hashemi;Hanna Klimczak-Pluci'nska;Harman Singh;Harsh Mehta;Harshal Tushar Lehri;Hussein Hazimeh;Ian Ballantyne;Idan Szpektor;Ivan Nardini;Jean Pouget-Abadie;Jetha Chan;Joe Stanton;J. Michael Wieting;Jonathan Lai;Jordi Orbay;Joe Fernandez;Joshua Newlan;Junsong Ji;Jyotinder Singh;Kat Black;Kathy Yu;Kevin Hui;Kiran Vodrahalli;Klaus Greff;Linhai Qiu;Marcella Valentine;Marina Coelho;Marvin Ritter;Matt Hoffman;Matthew Watson;Mayank Chaturvedi;Michael Moynihan;Min Ma;Nabila Babar;Natasha Noy;Nathan Byrd;Nick Roy;Nikola Momchev;Nilay Chauhan;Oskar Bunyan;Pankil Botarda;Paul Caron;P. Rubenstein;Phil Culliton;Philipp Schmid;Pier Giuseppe Sessa;Pingmei Xu;P. Stańczyk;P. Tafti;Rakesh Shivanna;Renjie Wu;Renke Pan;R. Rokni;Rob Willoughby;Rohith Vallu;Ryan Mullins;Sammy Jerome;Sara Smoot;Sertan Girgin;Shariq Iqbal;Shashir Reddy;Shruti Sheth;Siim Põder;Sijal Bhatnagar;S. Panyam;Sivan Eiger;Susan Zhang;Tianqi Liu;Trevor Yacovone;T. Liechty;Uday Kalra;Utku Evci;Vedant Misra;Vincent Roseberry;Vladimir Feinberg;Vlad Kolesnikov;Woohyun Han;Woosuk Kwon;Xi Chen;Yinlam Chow;Yuvein Zhu;Zichuan Wei;Z. Egyed;Victor Cotruta;Minh Giang;Phoebe Kirk;Anand Rao;Jessica Lo;Erica Moreira;Luiz Gustavo Martins;Omar Sanseviero;Lucas Gonzalez;Zach Gleicher;Tris Warkentin;V. Mirrokni;Evan Senter;Eli Collins;Joelle Barral;Z. Ghahramani;R. Hadsell;Yossi Matias;D. Sculley;Slav Petrov;Noah Fiedel;Noam M. Shazeer;O. Vinyals;Jeffrey Dean;D. Hassabis;K. Kavukcuoglu;Clément Farabet;Elena Buchatskaya;Jean-Baptiste Alayrac;Rohan Anil;Dmitry Lepikhin;Sebastian Borgeaud;Olivier Bachem;Armand Joulin;Alek Andreev;Cassidy Hardin;Robert Dadashi;L'eonard Hussenot;

ArXiv Published 2025/03/25

Summary:

A novel post-training recipe significantly improves the math, chat, instruction-following and multilingual abilities, making Gemma3-4B-IT competitive with Gemma2-27B-IT and Gemma3-27B-IT comparable to Gemini-1.5-Pro across benchmarks.

LEADERSHIP IN VIRTUAL TEAMS

K. A. Tatarinov;S. M. Muzyka;N. N. Anikienko;I. A. Savchenko;

Вестник Алтайской академии экономики и права

Summary:

This paper tries to answer the question: How does ICT affect the leadership in virtual teams?

The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update

Linelle Ann L Enis Olivier Ahmed H Wendi A Dannon Madeline Abueg Afgan Allart Awan Bacon Baker Bassetti Batut;Linelle Abueg;E. Afgan;Olivier Allart;Ahmed H Awan;W. Bacon;D. Baker;Madeline E. Bassetti;Bérénice Batut;Matthias Bernt;Daniel J. Blankenberg;Aureliano Bombarely;Anthony Bretaudeau;Catherine J. Bromhead;Melissa L Burke;Patrick K Capon;Martin Čech;María Chavero-Díez;John M Chilton;Tyler J Collins;Frederik Coppens;Nate Coraor;G. Cuccuru;Fabio Cumbo;John Davis;Paul F De Geest;Willem de Koning;Martin Demko;Assunta D. Desanto;J. D. Begines;Maria A. Doyle;Bert Droesbeke;Anika Erxleben-Eggenhofer;M. Föll;Giulio Formenti;A. Fouilloux;Rendani Gangazhe;Tanguy Genthon;Jeremy Goecks;Alejandra N Gonzalez Beltran;N. Goonasekera;Nadia Goué;Tim J. Griffin;Björn A. Grüning;Aysam Guerler;Sveinung Gundersen;Ove Johan Ragnar Gustafsson;Christina Hall;Thomas W Harrop;Helge Hecht;Alireza Heidari;Tillman Heisner;Florian Heyl;Saskia D. Hiltemann;H. Hotz;Cameron J Hyde;P. Jagtap;Julia Jakiela;James E. Johnson;Jayadev Joshi;Marie Jossé;Khaled Jum’ah;Matúš Kalaš;K. Kamieniecka;Tunc Kayikcioglu;M. Konkol;Leonid Kostrykin;Natalie Kucher;Anup Kumar;Mira Kuntz;Delphine Larivière;Ross Lazarus;Y. L. Bras;Gildas Le Corguillé;Justin Lee;Simone Leo;Leandro Liborio;Romane Libouban;David López Tabernero;Lucille Lopez-Delisle;Laila S Los;Alexandru Mahmoud;Igor Makunin;Pierre Marin;Subina P. Mehta;Winnie Mok;Pablo A Moreno;François Morier-Genoud;Stephen Mosher;Teresa Müller;Engy Nasr;A. Nekrutenko;Tiffanie M Nelson;Asime Oba;Alexander E. Ostrovsky;Polina V Polunina;Krzysztof Poterlowicz;E. Price;Gareth R Price;H. Rasche;Bryan Raubenolt;Coline Royaux;Luke Sargent;Michelle T Savage;Volodymyr Savchenko;Denys Savchenko;Michael C. Schatz;Pauline Seguineau;Beatriz Serrano-Solano;Nicola Soranzo;Sanjay Kumar Srikakulam;Keith Suderman;Anna Syme;M. Tangaro;Jonathan Tedds;M. Tekman;Wai Cheng (Mike) Thang;Anil S. Thanki;Michael Uhl;Marius van den Beek;Deepti Varshney;Jennifer Vessio;Pavankumar Videm;Greg Von Kuster;Gregory R Watson;Natalie Whitaker-Allen;Uwe Winter;M. Wolstencroft;F. Zambelli;P. Zierep;Rand Zoabi;

Nucleic Acids Research Published 2024/05/20

Summary:

Code development continues in line with the Galaxy Project roadmap, with improvements to job scheduling and the user interface, and general purpose graphical processing units (GPGPU) access for cutting-edge methods, and licensed tool support.

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

A. Ustun;Viraat Aryabumi;Zheng-Xin Yong;Wei-Yin Ko;Daniel D'souza;Gbemileke Onilude;Neel Bhandari;Shivalika Singh;Hui-Lee Ooi;Amr Kayid;Freddie Vargus;Phil Blunsom;Shayne Longpre;Niklas Muennighoff;Marzieh Fadaee;Julia Kreutzer;Sara Hooker;

Published 2024/02/12

Summary:

This work introduces Aya, a massively multilingual generative language model that follows instructions in 101 languages of which over 50% are considered as lower-resourced, and introduces extensive new evaluation suites that broaden the state of the art for multilingual eval across 99 languages.

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

DeepSeek-AI;Qihao Zhu;Daya Guo;Zhihong Shao;Dejian Yang;Peiyi Wang;Runxin Xu;Y. Wu;Yukun Li;Huazuo Gao;Shirong Ma;Wangding Zeng;Xiao Bi;Zihui Gu;Hanwei Xu;Damai Dai;Kai Dong;Liyue Zhang;Yishi Piao;Zhibin Gou;Zhenda Xie;Zhewen Hao;Bing-Li Wang;Jun-Mei Song;Deli Chen;Xin Xie;Kang Guan;Yu-mei You;A. Liu;Qiushi Du;W. Gao;Xuan Lu;Qinyu Chen;Yaohui Wang;C. Deng;Jiashi Li;Chenggang Zhao;C. Ruan;Fuli Luo;W. Liang;

ArXiv Published 2024/06/17

Summary:

DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens, which substantially enhances the coding and mathematical reasoning capabilities of DeepSeek-V2, while maintaining comparable performance in general language tasks.

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Terry Yue Zhuo;Minh Chien Vu;Jenny Chim;Han Hu;Wenhao Yu;Ratnadira Widyasari;Imam Nur Bani Yusuf;Haolan Zhan;Junda He;Indraneil Paul;Simon Brunner;Chen Gong;Thong Hoang;A. Zebaze;Xiao-ke Hong;Wen-Ding Li;Jean Kaddour;Minglian Xu;Zhihan Zhang;Prateek Yadav;Naman Jain;Alex Gu;Zhoujun Cheng;Jiawei Liu;Qian Liu;Zijian Wang;David Lo;Binyuan Hui;Niklas Muennighoff;Daniel Fried;Xiao-Nan Du;H. D. Vries;L. V. Werra;

ArXiv Published 2024/06/22

Summary:

An extensive evaluation of 60 LLMs shows that LLMs are not yet capable of following complex instructions to use function calls precisely, with scores up to 60%, significantly lower than the human performance of 97%, which underscores the need for further advancements in this area.

Can Generative AI improve social science?

Christopher A Bail;

Proceedings of the National Academy of Sciences of the United States of America Published 2024/05/09

Summary:

It is argued that social scientists can address many of these limitations of Generative AI by creating open-source infrastructure for research on human behavior, not only to ensure broad access to high-quality research tools, but also because the progress of AI will require deeper understanding of the social forces that guide human behavior.

UniProt: the Universal Protein Knowledgebase in 2025

Alex Maria-Jesus Sandra Michele Aduragbemi Shadab Emily Bateman Martin Orchard Magrane Adesina Ahmad Bowle;Alex Bateman;M. Martin;Sandra Orchard;M. Magrane;A. Adesina;Shadab Ahmad;E. Bowler-Barnett;Hema Bye-A-Jee;D. Carpentier;Paulus Denny;Jun Fan;Penelope Garmiri;Leonardo Jose da Costa Gonzales;Abdulrahman Hussein;Alexandr Ignatchenko;Giuseppe Insana;Rizwan Ishtiaq;Vishal Joshi;Dushyanth Jyothi;Swaathi Kandasaamy;A. Lock;Aurélien Luciani;Jie Luo;Yvonne Lussi;J. Marin;Pedro Raposo;Dan Rice;Rafael Santos;Elena Speretta;James L. Stephenson;Prabhat Totoo;Nidhi Tyagi;Nadya Urakova;Preethi Vasudev;Kate Warner;Supun Wijerathne;Conny Wing-Heng Yu;R. Zaru;Alan Bridge;L. Aimo;Ghislaine Argoud-Puy;A. Auchincloss;K. Axelsen;Parit Bansal;Delphine Baratin;Teresa M Batista Neto;Marie-Claude Blatter;Jerven T. Bolleman;E. Boutet;Lionel Breuza;Blanca Cabrera Gil;Cristina Casals-Casas;Kamal Chikh Echioukh;E. Coudert;B. Cuche;Edouard de Castro;A. Estreicher;M. Famiglietti;M. Feuermann;Elisabeth Gasteiger;Pascale Gaudet;S. Gehant;V. Gerritsen;A. Gos;Nadine Gruaz;C. Hulo;Nevila Hyka-Nouspikel;F. Jungo;Arnaud Kerhornou;P. Mercier;D. Lieberherr;P. Masson;A. Morgat;S. Paesano;I. Pedruzzi;S. Pilbout;L. Pourcel;S. Poux;M. Pozzato;Manuela Pruess;Nicole Redaschi;C. Rivoire;Christian J A Sigrist;Karin Sonesson;S. Sundaram;Anastasia Sveshnikova;Cathy H. Wu;C. Arighi;Chuming Chen;Yongxing Chen;Hongzhan Huang;K. Laiho;Minna Lehvaslaiho;Peter B. McGarvey;D. Natale;Karen Ross;C. R. Vinayaka;Yuqi Wang;Jian Zhang;

Nucleic Acids Research Published 2024/11/18

Structured information extraction from scientific text with large language models

John Dagdelen;Alex Dunn;Sanghoon Lee;Nicholas Walker;Andrew S. Rosen;G. Ceder;Kristin A. Persson;Anubhav Jain;

Nature Communications Published 2024/02/15

Summary:

A simple approach to joint named entity recognition and relation extraction is presented and how pretrained large language models can be fine-tuned to extract useful records of complex scientific knowledge is demonstrated.

BLINK: Multimodal Large Language Models Can See but Not Perceive

Xingyu Fu;Yushi Hu;Bangzheng Li;Yu Feng;Haoyu Wang;Xudong Lin;Dan Roth;Noah A. Smith;Wei-Chiu Ma;Ranjay Krishna;

ArXiv Published 2024/04/18

Summary:

Blink, a new benchmark for multimodal language models (LLMs) that focuses on core visual perception abilities not found in other evaluations, is introduced and will stimulate the community to help multimodal LLMs catch up with human-level visual perception.

A foundation model for clinical-grade computational pathology and rare cancers detection

Eugene Vorontsov;A. Bozkurt;Adam Casson;George Shaikovski;Michal Zelechowski;Kristen Severson;Eric Zimmermann;James Hall;Neil Tenenholtz;Nicolò Fusi;Ellen Yang;Philippe Mathieu;A. van Eck;Donghun Lee;Julian Viret;Eric Robert;Yi Kan Wang;J. Kunz;Matthew C H Lee;Jan H Bernhard;R. Godrich;Gerard Oakley;Ewan Millar;Matthew G Hanna;Hannah Y Wen;Juan Retamero;William A. Moye;Razik Yousfi;C. Kanan;D.S. Klimstra;B. Rothrock;Siqi Liu;Thomas J Fuchs;

Nature Medicine Published 2024/07/22

Summary:

Virchow is presented, the largest foundation model for computational pathology to date, and it is demonstrated that a large foundation model enables pan-cancer detection and can achieve similar performance to tissue-specific clinical-grade models in production and outperform them on some rare variants of cancer.

Testing theory of mind in large language models and humans

James W. A. Strachan;Dalila Albergo;Giulia Borghini;Oriana Pansardi;E. Scaliti;Saurabh Gupta;Krati Saxena;Alessandro Rufo;Stefano Panzeri;Guido Manzi;Michael S. A. Graziano;Cristina Becchio;

Nature Human Behaviour Published 2024/05/20

Summary:

It is demonstrated that large language models exhibit behaviour that is consistent with the outputs of mentalistic inference in humans but also highlights the importance of systematic testing to ensure a non-superficial comparison between human and artificial intelligences.

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning

Shivalika Singh;Freddie Vargus;Daniel Dsouza;Börje F. Karlsson;Abinaya Mahendiran;Wei-Yin Ko;Herumb Shandilya;Jay Patel;Deividas Mataciunas;Laura OMahony;Mike Zhang;Ramith Hettiarachchi;Joseph Wilson;Marina Machado;Luisa Souza Moura;Dominik Krzemi'nski;Hakimeh Fadaei;Irem Ergun;Ifeoma Okoh;Aisha Alaagib;Oshan Mudannayake;Zaid Alyafeai;Minh Chien Vu;Sebastian Ruder;Surya Guthikonda;Emad A. Alghamdi;Sebastian Gehrmann;Niklas Muennighoff;Max Bartolo;Julia Kreutzer;A. Ustun;Marzieh Fadaee;Sara Hooker;

Published 2024/02/09

Summary:

The primary goal is to bridge the language gap by building a human-curated instruction-following dataset spanning 65 languages, and the most extensive multilingual collection to date, comprising 513 million instances through templating and translating existing datasets across 114 languages.

A review of graph neural networks: concepts, architectures, techniques, challenges, datasets, applications, and future directions

Bharti Khemani;S. Patil;K. Kotecha;Sudeep Tanwar;

Journal of Big Data Published 2024/01/16

Summary:

The paper delves into specific GNN models like graph convolution networks (GCNs), GraphSAGE, and graph attention networks (GATs), which are widely used in various applications today and offers an extensive overview of the landscape of GNN research and its practical implementations.

Large language models (LLMs) as agents for augmented democracy

Jairo F. Gudiño;Umberto Grandi;César A. Hidalgo;

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences Published 2024/05/06

Hardware implementation of memristor-based artificial neural networks

F. Aguirre;A. Sebastian;M. Le Gallo;Wenhao Song;Tong Wang;J. J. Yang;Wei D. Lu;Meng-Fan Chang;D. Ielmini;Yuch-Chi Yang;Adnan Mehonic;Anthony J. Kenyon;M. A. Villena;J. Roldán;Yuting Wu;Hung-Hsi Hsu;N. Raghavan;J. Suñé;Enrique Miranda;A. Eltawil;Gianluca Setti;Kamilya Smagulova;K. N. Salama;O. Krestinskaya;Xiaobing Yan;K. Ang;Samarth Jain;Sifan Li;O. Alharbi;S. Pazos;M. Lanza;

Nature Communications Published 2024/03/04

Summary:

This work reviews the latest efforts for achieving hardware-based memristive artificial neural networks (ANNs), describing with detail the working principia of each block and the different design alternatives with their own advantages and disadvantages, as well as the tools required for accurate estimation of performance metrics.

CodeGemma: Open Code Models Based on Gemma

CodeGemma Team Heri Zhao;Jeffrey Hui;Joshua Howland;Nam Nguyen;Siqi Zuo;Andrea Hu;Christopher A. Choquette-Choo;Jingyue Shen;Joe Kelley;Kshi-tij Bansal;Luke Vilnis;Mateo Wirth;Paul Michel;Peter Choy;Pratik Joshi;Ravin Kumar;Sarmad Hashmi;Shubham Agrawal;Zhitao Gong;Jane Fine;Tris Warkentin;Ale Jakse Hartman;Bin Ni;Kathy Korevec;Kelly Schaefer;Scott Huffman;

ArXiv Published 2024/06/17

SwissDock 2024: major enhancements for small-molecule docking with Attracting Cavities and AutoDock Vina

Marine Bugnon;U. Röhrig;Mathilde Goullieux;Marta A. S. Perez;Antoine Daina;O. Michielin;Vincent Zoete;

Nucleic Acids Research Published 2024/04/30

Summary:

The latest version of SwissDock is presented, in which EADock DSS has been replaced by two state-of-the-art docking programs, i.e. Attracting Cavities and AutoDock Vina, and a user-friendly command-line access is developed which enables covalent ligand docking with Attracting Cavities.

Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

Axel Sauer;Frederic Boesel;Tim Dockhorn;A. Blattmann;Patrick Esser;Robin Rombach;

SIGGRAPH Asia 2024 Conference Papers Published 2024/03/18

Summary:

This work introduces Latent Adversarial Diffusion Distillation (LADD), a novel distillation approach overcoming the limitations of ADD and utilizes generative features from pretrained latent diffusion models, enabling high-resolution multi-aspect ratio image synthesis.

Ready to explore more Article Galaxy?

Get a free Article Galaxy account Read the Article Galaxy Blog

Who owns this site?

Article Galaxy Pages is a free service from Research Solutions, a company that offers access to content in collaboration with publishing partners, online repositories and discovery services.